Matching Phrases for Arabic-to-English Example-Based Translation System
نویسندگان
چکیده
An implementation of a non-structural Example-Based Machine Translation system that translates sentences from Arabic to English, using a parallel corpus aligned at the paragraph level, is described. Each new input sentence is fragmented into phrases and those phrases are matched to example patterns, using various levels of morphological data. The system has been implemented and automatically evaluated. Results are encouraging.
منابع مشابه
System Description for IWSLT 2010
Our submission is a non-structural Example-Based Machine Translation system that translates text from Arabic to English, using a parallel corpus aligned at the paragraph / sentence level. Each new input sentence is fragmented into phrases and those phrases are matched to example patterns, using various levels of morphological information. Source-language synonyms were derived automatically and ...
متن کاملDescription for IWSLT 2010
Our submission is a non-structural Example-Based Machine Translation system that translates text from Arabic to English, using a parallel corpus aligned at the paragraph / sentence level. Each new input sentence is fragmented into phrases and those phrases are matched to example patterns, using various levels of morphological information. Source-language synonyms were derived automatically and ...
متن کاملModified Dijkstra Like Search Algorithm for English to Arabic Machine Translation System
In this paper we are introducing a modified Dijkstra’s shortest path algorithm used to detect the target language phrases. We list the indexes of the source sentence’s words which were found in the target language corpus and create a directed graph to detect the phrases that form a shortest path walk in the graph. This method is used in a hybrid English to Arabic MT system. The system combines ...
متن کاملTuning a phrase-based statistical translation system for the IWSLT 2005 Chinese to English and Arabic to English tasks
Nowadays, most of the statistical translation systems are based on phrases (i.e. groups of words). We describe a phrase-based system using a modified method for the phrase extraction which deals with larger phrases while keeping a reasonable number of phrases. Also, different alignments to extract phrases are allowed and additional features are used which lead to a clear improvement in the perf...
متن کاملImproving Example Based Machine Translation Through Morphological Generalization and Adaptation
Example Based Machine Translation (EBMT) is limited by the quantity and scope of its training data. Even with a reasonably large corpus, we will not have examples that cover everything we want to translate. This problem is especially severe in Arabic due to its rich morphology. We demonstrate a novel method that exploits the regular nature of Arabic morphology to increase the quality and covera...
متن کامل